(19) 



J 



(12) 



(43) Date of publication: 

29.04.1998 Bulletin 1998/18 

(21) Application number: 97116453.8 

(22) Date of filing: 22.09.1 997 



Europaisches Patentamt 
European Patent Office 
Office europeendes brevets (11) EP 0 838 945 A2 

EUROPEAN PATENT APPLICATION 

(51) Int. a 6 : H04N5/44 



(84) Designated Contracting States: 


* Ma, Yue 


AT BE CH DE DK ES Fl FR GB GR IE IT LI LU MC 


Plainsboro, New Jersey 08536 (US) 


NL PT SE 


• Tomkins, Andrew 




Pittsburgh, Pennsylvania 15218 (US) 


(30) Priority: 25.10.1996 US 736982 


- Zhou, Jian 




Plainsboro, New Jersey 08536 (US) 


(71) Applicant: 


MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. 


(74) Representative: 


Kadoma-shi, Osaka 571 (JP) 


Steil, Christian, Dipl.-lng. et al 




Witte, Welter, Gahlert, 


(72) Inventors: 


Otten & Steil, 


• Lopresti, Daniel P. 


Patentanwalte, 


Hopewell, New Jersey08525 (US) 


Rotebuhlstrasse 1 21 




70178 Stuttgart (DE) 



CM 
< 

a> 

CO 
CO 
CO 

o 

LU 



(54) Video user's environment 

(57) The user (1 02) communicates through a digitiz- 
ing writing surface (26) with the audio/video control 
apparatus (20). An on-screen display (32, 34) is gener- 
ated, providing the user (102) with a user environment 
in which a wide range of different tasks and functions 
can be performed. The digitizing writing surface (26) 
can be incorporated into a hand-held remote control 
unit (24) and the audio/video control apparatus (20) 
may likewise be incorporated into existing home enter- 
tainment or computer equipment. By tapping on the 
writing surface (26) a command bar (32) is presented on 
the screen, allowing the user (102) to select among var- 
ious functions. Included in these functions is. an on- 
screen programming feature (158, 148), allowing the 
user to select programs for viewing or recording by entry 
of user-drawn annotations or commands via the writing 
surface (26). 
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Description 

Background and Summary of the Invention 

5 The present invention relates generally to the control of audio, video and multimedia equipment. More particularly, 
the invention relates to an on-screen user interface for interacting with audio, video and multimedia components using 
a remote control apparatus having a digitized writing surface for entry of hand-drawn instructions by the user. 

Television is on the verge of a revolution. Previously separate computer, communications and consumer electronics 
technologies are converging. This convergence will undoubtedly yield a rich assortment of program content and serv- 

10 ices, although it is by no means clear that a user will be able to navigate through the assortment of choices to find what 
he or she is interested in. For example, future systems are expected to provide both high quality digital, audio and video, 
up to 500 channels of programming, and a variety of on-demand services, including home shopping and banking, inter- 
active games and entertainment, multimedia libraries and full access to the Internet. 

Providing a user interface for a complex system such as this is by no means a simple task. Easy-to-use access to 

15 a complex system - as television is expected to become - simply cannot be accomplished using the numeric keypad 
and forward and reverse buttons on today's hand-held remote controls. In terms of convenience and usability, present 
hand-held remote controls have already reached the point of diminishing returns. Adding more buttons makes these 
systems harder to control, not easier. Some systems today use on-screen display to echo the current operating param- 
eter of a remote control push button as it is being pushed. While pressing the Color Tint button, for example, the con- 

20 ventional system may display a bar graph showing the current tint setting. While this simple user feedback system is 
certainly better than nothing, it by no means solves the more fundamental problem of how to provide intuitive control to 
users of all ages and all nationalities. Also, while the on-screen display of parameters may be viewable in a darkened 
room, the push buttons used to control these parameters may not be visible. Thus the greater the number of push but- 
tons on a hand-held remote, the harder it becomes to locate the correct push button while in a room darkened for opti- 

25 mal viewing. 

Aside from the shortcomings of push button user interface technology, current technology is also deficient in sup- 
porting users that do not have the time or inclination to learn complex system features or users, such as preschool chil- 
dren, who cannot read. The addition of a computer style keyboard for controlling the functions does not help to simplify 
such a system. Moreover, the placement of a keyboard on the family room coffee table appears less acceptable than a 

30 small remote control or digitized writing tablet. 

The present invention takes a fresh approach to the problem. Although the hand-held remote with push buttons 
may still be used, the present invention provides a digitizing writing surface through which the user may enter hand- 
drawn instructions. These instructions can be handwritten text, symbols or even pictures, all of which are written to the 
digitized writing surface using a pen or stylus. Such a means for controlling the system and providing input appeals to 

35 a broader range of users than does a conventional keyboard. Through the mechanism of providing hand-drawn instruc- 
tions, complex systems can be controlled with ease. The user can create his or her own hand<Jrawn instructions 
(words, symbols, pictures, etc.) to represent any desired control function, even complex control functions such as 
instructing the audio/video system to turn on at a certain time and display the user's selected favorite program, or to 
search all available programs to locate those meeting the user's criteria of interest. This hand-drawn input can also 

40 include gestures which are recognized by the system and processed as commands to control various functions of the 
audio/video system. For example, drawing a large "X" over the digitized writing surface could be interpreted as a com* 
mand to turn off the television and/or the audio/video system. Additionally, handwritten symbols or text input can be writ- 
ten to the digitized writing surface and then processed using known handwriting recognition technology as if the 
symbols were typed on a keyboard. Once the handwriting is translated into standard character symbol codes, this input 

45 can be further processed or stored in the system's memory for later use. 

According to one aspect of the invention, the enhanced video user environment comprises an audio/video control 
apparatus that selectively performs predetermined audio/video control functions according to the user's selection or 
instruction. The control apparatus is preferably designed with a port for coupling to a video display apparatus, such as 
a television, or projection system or monitor. The audio/video control apparatus can be packaged separately from the 

so existing audio/video equipment, or it can be incorporated into existing components. A remote control apparatus having 
a digitizing writing surface is provided for entry of hand-drawn instructions by the user. The remote control apparatus 
communicates with the audio/video control apparatus. Alternatively, a full-featured personal digital assistant (PDA) that 
implements TV remote control as one of its programmable functions could also be used as the remote control appara- 
tus. Many commercially available PDAs currently include means for wireless communication, such as an infrared link. 

55 The system further includes a processor that communicates with the audio/video control apparatus, the remote 
control apparatus or both. The processor controls operation of the video display apparatus in accordance with the hand- 
drawn instructions provided through the digitizing writing surface. The processor can be incorporated with the circuitry 
of the audio/video control apparatus, or it can be incorporated with the circuitry of the remote control apparatus. It is 
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also possible to implement the invention using multiple processors, one associated with the audio/video control, and 
another associated with the remote control. The multiple processors work in concert as distributed processors to imple- 
ment the processing functions required by the invention. 

For a more complete understanding of the invention, its objects and advantages, refer to the following specification 
5 and to the accompanying drawings. 

Brief Description of the Drawings 

Figure 1 illustrates a first embodiment of the invention in which the audio/video control apparatus is packaged as a 
10 set top box, suitable for use with a simple television set; 

Figure 2 is another embodiment of the invention in which the audio/video control apparatus is packaged as part of 
a home entertainment system; 

is Figure 3 is a close up perspective view of an exemplary remote control unit with digitizing writing surface; 

Figure 4 is a system block diagram showing the components of the invention together with ©camples of other com- 
ponents of audio/video equipment, illustrating how the invention is interconnected with this equipment; 

20 Figure 5 is a block diagram showing the hardware components of the audio/video control apparatus and remote 
control apparatus; 

Figure 6 is a block diagram of the presently preferred software architecture of the invention; 

25 Figure 7 is a diagram representing a screen snapshot, showing the command bar of the presently preferred user 
interface; 

Figure 8 shows the sign-in panel of the presently preferred user interface; 

30 Figure 9 shows an example of an ink search in the sign-in panel of the preferred user interface; 

Figure 10 illustrates standard television controls available for manipulation through the user interface by selecting 
the TV button on the command bar; 

35 Figure 1 1 illustrates an example of a TV channel search using approximate ink matching; 

Figure 12 shows a TV program schedule as presented through the user interface; 

Figure 13 shows a similar TV program schedule that has been limited to display only certain categories by manip- 
40 ulation through the user interface; 

Figure 14 shows a VCR control function display produced by selecting the VCR button on the command bar; 

Figure 15 shows an example of the video game quick access interface; 

45 " . ■ ' ' ' "'■ '•• . ' • ' ' • ' . '.' 

Figure 16 shows an example of the home shopping access interface; 

Figure 1 7 shows an example of the ink mail (l-mail) user interface; 

so Figure 13 is a flow diagram describing the ink data interpretation that forms part of the recognition system; 

Figure 19 is an entity relationship diagram illustrating the steps that the system performs in searching for a user- 
drawn entry or annotation; 

55 Figure 20 is a functional diagram illustrating the basic edit distance technique used by the preferred embodiment; 
and 

Figure 21 is another functional diagram illustrating how approximate matching may be performed with the edit dis- 
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tance technique. 
Description of the Preferred Embodiment 

5 The present invention may be implemented as an audio/video system having an enhanced video user interface or 
video user environment. Many different implementations are possible. Before proceeding with a detailed description of 
the system in its presently preferred form, an overview of two different implementations will be illustrated and described. 
These are simply examples of how one might implement the invention in a working system. Other systems are of course 
possible. . 

10 Referring to Figure 1 the system of the invention is illustrated in a simple embodiment suitable for use with stan- 
dalone television sets or other less complex home entertainment systems. As illustrated in Figure 1, the invention 
includes an audio/video control unit 20 that is packaged as a set-top box designed for placement atop a television 22. 
The hand-held remote control 24 includes a digitizing writing surface 26 on which the user may enter hand-drawn 
instructions using a suitable pen or stylus 28. A personal digital assistant (PDA) could also be substituted or used in 

15 conjunction with remote control 24 and would include a digitizing writing surface and stylus. The control unit 20 and 
remote control 24 communicate with one another via an infrared link depicted diagrammatically at 30. In this embodi- 
ment, the audio/video control unit includes a port on the rear of the unit (not shown) for coupling to the Video In port of 
television 22. In this way, the television 22 serves as a video display apparatus upon which the video user interface is 
projected. In Figure 1 the video user interface has been shown in reduced detail as including a command bar 32 and a 

20 user interactive panel 34. The command bar 32 and panel 34 are projected onto the television screen (by inclusion of 
appropriate signals) with the existing NTSC video signals generated by the television tuner. Full details of the video user 
interface will be presented below. If desired, the control unit 20 may include a television tuner module suitable for receiv- 
ing and decoding radio frequency television broadcasts via antenna or cable input. The tuner module supplies NTSC 
video signals to the Video In port of the television, bypassing the need to use the internal tuner section of the television. 

25 A more complex home entertainment system is shown in Figure 2. In this embodiment the remote control 24 is 
essentially the same as described in connection with Figure 1 . The control unit 20 may be configured as a rack mount 
unit for inclusion in the home entertainment system, along with other components of audio/video equipment. For illus- 
tration purposes, the home entertainment system depicted here includes a large screen projection television 36, sur- 
round sound speakers 38, subwoofer 40 and multifunction tuner/amplifier 42. The tuner/amplifier has video and audio 

30 inputs to which additional components of audio/video equipment may be connected. Illustrated here is a digital audio 
tape player 44, VCR 46, laser disc player 48 and camcorder 50. These are simply examples of the type of equipment 
that might be used with the present invention. Also included in the illustrated system is a personal computer 52. The 
personal computer may be connected to an Internet service provider. The control unit 20 is shown as a separate com- 
ponent in Figure 2 for illustration purposes. However, it is not necessary to package the control unit 20 as a separate 

35 component as illustrated here. Rather, the control unit may be incorporated into any of the audio/video components, 
including the television itself. 

An enlarged view of the remote control 24 is shown in Figure 3. The presently preferred remote control 24 is housed 
in a hand-held case 54 having generally the same form factor and dimensions as a conventional hand-held remote con- 
trol unit. The remote control includes a conventional numeric keypad 56, VCR and laser disc motion control buttons 58 
40 as well as selected other buttons for providing convenient control of commonly used features. A thumb-operated jog 
shuttle wheel 60 may also be induded for selecting various other system operating functions. Alternatively, a jog shuttle 
dial may be used in place of the thumb operated jog shuttle. 

The remote control 24 includes a digitizing writing surface 26 that is designed to receive hand-drawn input through 
a pen or stylus 28. If desired; the digitizing writing surface can be hingedly attached to the case 54, allowing the writing 
45 surface to be flipped up to reveal additional push buttons beneath. The digitizing writing surface 26 of the preferred 
embodiment is a passive screen that accepts pen stroke input (according to the ink data type described below) without 
providing visual feeoback on the writing surface itself. According to this embodiment, the visual feedback appears on 
the video screen. One skilled in the art will also appreciate that digitizing writing surface 26 may be embodied in a sep- 
arate tablet unit which can be placed upon a fixed surface, such as a table, allowing the tablet to be written to more corn- 
so fortably. Alternatively, the digitizing writing surface may be implemented as an active screen that not only accepts pen 
stroke input but also includes a writable display. The active screen may be backlit so that it may be viewed in the dark. 

An overview of the presently preferred system is shown in Figure 4. Specifically, Figure 4 illustrates the control unit 
20 and remote control 24 previously described. The control unit 20 includes a port 62 for coupling to a video display 
apparatus 64. As previously discussed, the display apparatus may be a television set or television monitor, or it may be 
55 a flat panel display, a projection system or a computer monitor. In most home entertainment systems the display func- 
tion is provided by the television. 

The audio/video control 20 may also be coupled to other equipment such as VCR 46, laser disc player 48 and mul- 
timedia computer 52. This is not intended to be an exhaustive list, as there is a wealth of entertainment and information 
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technology that can be coupled to the audio/video control 20. In Figure 4 this other equipment is shown diagrammati- 
cally as other media 66. These media are preferably connected by conventional cabling 68 to the audio/video control 
20. The audio/video control thus operates as the audio/video signal switching and processing center for the system. For 
example, if the user has selected the VCR 46 as the source of program content, the audio and video signals from the 
5 VCR are switched through audio/video control 20 and communicated through port 62 to display 64. In this regard, the 
audio/video control 20 is preferably capable of handling multiple tasks concurrently. Thus the laser disc player 48 may^ 
be selected as the current source of program material for presentation on display 64, while VCR 46 is taping a television 
broadcast for later viewing. The audio/video control may include a television tuner to supply the necessary audio and 
video signals to the VCR. 

ro Whereas audio and video signal flow is routed between components using cabling 68. the control functions can be 
provided via an alternate link such as an infrared link. In Figure 4 an infrared transponder 70 provides this function. The 
audio/video control 20 sends a command to transponder 70 and the transponder broadcasts that command to each of 
the components in the system. The infrared command includes a device header indicating which of the components 
should respond to the command. In one embodiment, the infrared link is bidirectional, allowing components such as the 

75 VCR 46 or multimedia computer 52, to send infrared replies back to the audio/video control 20. However, the infrared 
link may also be unidirectional, as with current remote controls. There are, of course, other ways of communicating con- 
trol signals between the various components and the audio/video control 20. Infrared has the advantage of being com- 
patible with existing home entertainment equipment. By using infrared control, the audio/video control 20 is able to 
control the operation of home entertainment components that were designed before the advent of the present technol- 

20 ogy. Alternatively, the individual component may have infrared networking capabilities so that the remote control 24 can 
communicate directly with the components without having to go through the audio/video control 20. Thus the video user 
environment of the invention can be incorporated into existing systems, working with most of the user's existing equip- 
ment. 

The remote control 24 and control unit 20 preferably employ a form of distributed processing, in which each unit 

25 includes a processor that works in concert with the other. In Figure 4 this distributed architecture is depicted diagram- 
matically by processor 72, shown as being shared by or related to both the remote control 24 and the control unit 20. 
Although distributed processing represents the preferred implementation, the video user environment could be imple- 
mented by a system in which all of the processing power is concentrated in one of the remote control or control unit 
devices alone. For example, the remote control 24 could be constructed with minimal processing power and configured 

30 to simply relay all hand-drawn instructions of the user to the control unit 20 for interpretation. Such a configuration would 
require a higher data transfer rate between the remote control 24 and control unit 20. An alternate embodiment places 
processing power in the remote control 24, so that user-entered, hand-drawn instructions are interpreted in the remote 
control unit, with higher level instructional data being sent to the control unit 20 for further processing. 

Figure 5 shows the hardware architecture of the preferred implementation. The components of the remote control 

35 unit 24 and the audio/video control unit 20 are shown in the dotted line boxes numbered 24 and 20, respectively. The 
remote control unit includes a processor 72a having local random access memory or RAM 74 as well as read only 
memory or ROM 76. While these functions are shown separately on the block diagram, processor 72a, RAM 74, ROM 
76 and various other functions could be implemented on a single, highly integrated circuit using present fabrication 
technology. Coupled to the processor 72a is an infrared interface 78. The remote control unit 24 may optionally include 

40 a push-button display 77 which provides visual feedback via various light functions and a push-button keypad 79 for pro- 
viding input to control unit 20. Push-button keypad 79 could have preprogrammed functions or may be programmed by 
the user, including a learning function which would allow keypad 79 to take on universal functions. Remote control 24 
may also be provided with a microphone interface 81 for receiving spoken commands from the user. One skilled in the 
art will appreciate that processor 72a or 72b may implement well-known voice processing technology for interpreting 

45 spoken commands into computer instructions. The remote control unit 24 also includes a digitizing writing surface com- 
prising tablet interface 80 and tablet 82. The tablet interface 80 decodes the user-entered, hand-drawn instructions, 
converting them into positional or spatial data (x,y data). Processor 72a includes an internal clock such that each x,y 
data value is associated with a time value, producing a record of the position of the pen or stylus as it is drawn across 
tablet 82. This space/time data represents the hand-drawn instructions in terms of the "ink" data type. The ink data type 

50 is a defined data type having both spatial and temporal components (x p y,t). The ink data type is described more fully 
below. 

The audio/video control unit 20 also includes a processor 72b having associated RAM 86 and ROM 88. Processor 
72b is also provided with an infrared interface 90. Infrared interface 90 communicates unidirectionally or bidirectionally 
(depending on the embodiment) with infrared interface 78 of the remote control 24. In addition to the infrared interface, 
55 processor 72b also includes video interface circuitry 92 that supplies the appropriate video signal to the video out port 
62. 

Much of the video user environment is preferably implemented as software that is executed by the distributed proc- 
essor architecture 72 (e.g. 72a and 72b). The architecture of this software is depicted in Figure 6. The software can be 
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stored in the read only memories ROM 76 and ROM 88 of the remote control unit 24 and control unit 20, respectively. 
Alternatively, the software could also be downloaded to random access memories RAM 74 and RAM 86 over various 
transmission media, including but not limited to, standard telephone lines, fiber optic cable or the television cable that 
also delivers the video signals. 

5 Referring to Figure 6, the software component of the invention is depicted diagrammatically at 100. As illustrated, 

the software component is situated between the user 102 and the hardware 104. The software provides each of the 
functions depicted generally at 106. 

The software component 100 has been illustrated here as the concatenation of several layers. At the lowest layer, 
closest to the hardware 104, is the hardware abstraction layer 108. This layer provides the connection to the actual 

w hardware 104. The hardware abstraction layer handles hardware-related issues such as implementing timers, tuning 
television tuners, supporting video and graphics adapter hardware, providing security functions and operating periph- 
erals. The hardware abstraction layer would, for example, include the necessary device driver for the tablet interface 80. 

One level above the hardware abstraction layer is the microkernel layer 110. The microkernel layer serves as the 
real time operating system for the video user environment. The real time operating system employs drivers and librar- 

15 ies, illustrated in layer 1 12, to produce the higher level input, video and network management functions. The user inter- 
face layer 114 is supported by the underlying layers 108, 110 and 112 Applications such as electronic program guide, 
video player and multiuser games, are run within the user interface layer 114. An exemplary application is illustrated at 
116. 

20 Preferred Video User Interface 

The preferred video user interface, generated by user interface layer 114, is shown in Figures 7-14. 
Referring to Figure 7, the preferred video user interface presents a command bar 32 preferably at a predetermined 
location such as at the lower edge of the screen. The command bar provides access to various functions; the preferred 
25 command bar provides eight buttons for accessing those functions whose names appear on the buttons. Normally there 
is no indication that the video user environment is running on a particular video display device or television. During nor- 
mal viewing operation the video picture fills the entire screen and the command bar 32 is not present. When the user 
wants to access the video user environment functionality, the user requests the command bar 32 by tapping the pen 
once anywhere on the digitizing tablet or pressing a button on the remote control unit 24 to make command bar 32 
30 appear on the screen. Another tap of the pen or press of the button causes the command bar to disappear. 

Anyone can walk up to a television equipped with the present invention and start using it immediately. However, 
much of the power of the video user environment comes from the ability to create personal annotations. For example, 
a user might draw a short descriptive pictogram to mark a favorite channel. 

Before such personalized data can be made available the user must identify himself or herself to the system. This 
35 is accomplished by selecting the "Sign In" button on the command bar by tapping it once. This brings up a panel shown 
in Figure 8 through which the user may sign in. The panel comprises a user list 120 on which two types of information 
are displayed: a text string 122 and an associated ink region 124. The identity of each user is symbolized by the text 
string and its associate ink region. As illustrated, the ink region may not necessarily duplicate the text. In Figure 8 the 
text string JZ identifies the user who has signed her name as "Sophie" in the ink region. The ink region is entirely uncon- 
40 strained: H can be a picture, a doodle, a signature, a word written in any language and so forth. There is explicit binding 
between the ink region and the text string, such that the bound pair is understood by both the system and the user as 
identifying a single individual. The linking of the ink region and the text string forms a data structure often referred to as 
a tuple. This same paradigm carries through a number of the video user environment applications to be discussed. 
Once the Sign In panel is on screen the user may select an ID by tapping on it. Tapping the "Do ft!" button com- 
45 pletes the action, logging in the user as the indicated ID. Alternately, the user may search for a specific ID using a 
searching feature of the invention discussed below. The searching feature uses an approximate ink matching technique, 
thus the user does not need to sign in precisely the same way each time. The system is flexible enough to accommo- 
date normal handwriting variations. 

The Sign In panel also offers the option of adding, deleting or editing a user ID. These operations are modal, mean- 
50 ing that they apply to a specific ID instance. Thus the "Edit" button is only active when an ID is selected. 

The system is capable of performing the approximate ink matching search on a user entered hand-drawn annota- 
tion. By tapping on the Search button 126 a search dialog box 128 is presented as illustrated in Figure 9. The user 
enters a hand-drawn entry or annotation in the ink region 130 and this entry is compared with the ink data previously 
stored as user IDs. The approximate ink matching system of the invention identifies the best match and highlights it in 
55 the user list 1 20 as shown. If the user determines that the highlighted entry is not correct, the user may proceed to the 
next best match by typing the "Find" button 132 again. The process can be repeated until the desired ID is found. 

As an alternate searching technique, the user can search for the ID based on the entry in the text string region 122. 
This is done by typing the desired text string using a soft keyboard brought up by tapping on the keyboard icon 134. The 
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keyboard icon preferably appears as a standard QWERTY keyboard resembling a conventional keyboard found on a 
personal computer. When the keyboard is used to enter a text string, the system finds an exact match in the list of IDs 
by searching for the character string entered by the user. Like the ink search, the text matching search can also be 
approximate. Thus if the user enters the query "ddl" the text string "dpr would be considered a better match than the 
5 text string "jeff." 

After the user has signed in with the user list screen, a briefly displayed confirmatory screen is projected showing 
the text and ink data representing the ID through which the user has signed in. Also, if desired, the time of day may also 
be momentarily displayed. After the confirmatory screen has been displayed for a suitable length of time (e.g. five sec- 
onds) it disappears, leaving only the current video screen visible. In the event the user chooses not to sign in, the sys- 

10 tern assumes that the last entered user ID is applicable by default. 

The video user environment of the invention provides a full complement of standard television controls such as vol- 
ume, balance, brightness, color and so forth. In addition, an on-screen keypad is available for changing channels by 
direct entry of the numeric channel number or by "surfing" up and down the dial by clicking suitable up and down but- 
tons. The standard television controls are presented by tapping the TV button 136 on command bar 32. 

is The presently preferred implementation continues to use the traditional remote control push buttons for performing 
standard television control functions such as those listed above. For continuity and maximum flexibility, these same 
functions are duplicated on screen through the video user interface. 

Although the video user interface provides the same ability to control standard television control functions as the 
traditional remote control, the video user interface of the invention goes far beyond the traditional remote control. The 

20 invention provides sophisticated tools to help the user manage his or her video programming. Figure 10 shows the tel- 
evision control panel 138 that is displayed when the TV control button 136 is tapped. The numeric keypad 140 is used 
to enter television channels directly and the up and down buttons 142 sequentially surf through the channels in forward 
and backward directions. By tapping on the channel list button 144 brings up a scrollable list of channels with handwrit- 
ten annotations as illustrated in Figure 1 1 . As with the sign in panel, it is possible for the user to select an item manually 

25 or search for an item using the approximate ink or text matching techniques. In this case, the numeric pad 140 
(accessed by tapping on the appropriate numeral icons) limits the user to numeric input (i.e. TV channels). Tapping on 
the "Schedule" button 146 displays a convenient television schedule illustrated in Figure 12. The preferred implemen- 
tation portrays the TV schedule in the form of a traditional paper-based television guide. It has the distinct advantage, 
however, of knowing what time it is. Thus, the TV schedule screen (Figure 12) highlights programs currently playing, to 

30 assist the user in making a choice. Thus the TV schedule of Figure 12 is an active schedule capable of highlighting 
which are current programs, updating the display in real time. In Figure 12 the active programs are designated by dotted 
lines at 148 to indicate highlighting. The present invention carries the concept of active scheduling one step further, 
however. Each program in the display is tagged with a predefined icon indicating its genre. Thus news, sports, drama, 
comedy, kids and miscellaneous may be designated. The user may limit the TV schedule to display only those pro- 

35 grams in certain genres by tapping the "Clear All" button 150 and by then activating one or more of the check boxes in 
the category pallet 152. In the example shown in Figure 13, the user has elected to limit the display of programs in the 
sports, comedy and kids categories. This feature in the video user environment makes it much easier for the user to 
identify which programs he or she wants to watch. 

Finally, the TV schedule allows the user to program the TV to change channels at specific times automatically. Thus 

40 the user does not miss an important show. Unlike programming of current VCRs, which can be complicated and frus- 
trating, programming in the video user environment is handled in a highly intuitive way. The user simply taps on a show 
displayed in the schedule (such as "World Series" in Figure 13). thereby highlighting it. Then, at the appropriate time, 
the video user environment switches to the proper channel (in this case channel 2). As with all video user environment 
applications, ease of use is key 

45 The foregoing has described how the video user environment may be used to access and control television. Similar 
capability is provided for other audio and video components such as the VCR. Figure 14 depicts the VCR control panel 
154 that is displayed when the VCR button 156 is tapped. The VCR control panel provides traditional play, stop, pause, 
rewind and fast forward control. In addition, if the VCR equipment is capable of such functionality, the VCR tape can be 
indexed forward or backward on a frame-by-frame basis. Similar capabilities can be provided for controlling laser disc 

so players, for example. 

As best illustrated in Figure 14, tapping the "Program" button 158 calls up a display visually identical to the TV 
schedule display of Figure 1 2. However, the TV schedule and the VCR schedule are maintained as separate data struc- 
tures, so that the user may program the TV and VCR independently. Using the same visual displays for different but 
comparable functions is one way the presently preferred implementation makes the system easier to use. By reusing 
55 the same icons and tools (including the same window layouts, locations and function of buttons) speeds the learning 
process, as the user only needs to have experience with one instance of the tool to know how to apply it in its other 
settings. This also makes the video user environment application smaller, as code can be shared among several func- 
tions. 
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Tapping on the "Library" button 160 (Figure 14) brings up yet another browser displaying text and ink annotations 
in pairs. Similar in appearance to the channel list of Figure 11 , the video library displays entries that correspond to spe- 
cific video programs that the user can view at will. Thus the video library can serve as an interface to a video on demand 
system or to recordings in the user's own personal collection. For example, the user might enter "Nightly News" in the 
video library, keying it to a particular video on demand selection. Alternatively, the user may call up a memorable sport- 
ing event such as "Bob's Favorite Yankee Game." Thus the user could later search through the entries in the video 
library and select an archived event by tapping on it. This would in turn cause the video on demand system to com- 
mence delivery of the news or other entertainment program to the user. As video on demand systems become more 
sophisticated, this capability can be quite valuable. For example, the user might wish to use the video library to review 
nightly news programs for the week he or she was on vacation and unable to watch the news. Or, the user might wish 
to use this video library to call up previous sporting events from the video on demand system. 

Tapping the "Games" button 162 (Figure 14) brings up a window (Figure 15) that provides a quick and easy inter- 
face for a user (even a child) to access a variety of on-line games. Some of these games may involve other players on 
a network. The presently preferred embodiment of the video user environment does not directly implement any of these 
games, as it is contemplated that such games would be supplied by commercial software developers. The preferred 
interactive games interface simply displays a plurality of icons to represent each of the available games on the user's 
system. 

Tapping on the "Shopping" button 164 calls up a display of home shopping options (Figure 16). Preferably each 
option is displayed as a separate icon that the user may tap on in order to access those shopping services. If desired, 
the shopping button could call up a web site on the Internet that could be used as a starting point for supplying hypertext 
links to other shopping locations. 

Tapping on the "l-Mail" button 166 (ink-mail) provides the user with an electronic mail communication system. In 
contrast with conventional E-mail systems that rely on keyboard-entered text, the video user environment allows the 
user to send hand-drawn or handwritten messages. The l-mail interface (Figure 17) preferably provides a notepad area 
into which the user can draw handwritten messages that may then be sent via the Internet or other suitable communi- 
cation network to a recipient. These handwritten messages allow for more personalized correspondence and are more 
accessible than typed electronic mail. Additionally, writing with a pen is more powerful. For example, a user can begin 
writing an l-mail text message and then switch to drawing a map without changing tools as is required with current key- 
board/mouse-based electronic mail systems. 

As discussed above, the video user environment has access to a system dock whereby the TV schedule and VCR 
schedule are made active. The clock button 168 (Figure 14) may be tapped to call up a screen in which the user can 
set the correct date and time of day of the system. 

Preferred Ink Search and Retrieval Technology 

The preferred embodiment uses an approximate matching procedure to identify and rank possible hand-drawn 
"ink" entries made by the user using the digitizing tablet and pen. The approximate matching procedure is a fuzzy 
search procedure that identifies and ranks possible substring match candidates based on a scoring and ranking dis- 
tance between the query and the candidate. The procedure produces a score for each candidate, allowing the candi- 
dates to be ranked in order of "goodness." 

One benefit of the approximate matching procedure is that any line breaks in the user-drawn entry or query have 
no impact on the ink search. Line breaks in writing are ignored, so that the user does not have to remember where the 
line breaks may have occurred in the original entry. 

The fuzzy search technique of the preferred embodiment uses a vector quantized (VQ) representation of the user- 
drawn entry to capture arid compare pen strokes of the ink data type. The ink data type is a system defined data type 
that captures the precise (X,Y) position of the pen tip over time as the user writes or draws an annotation or entry. Thus 
the ink data type captures not only the spatial position of the ink, but also the temporal sequence over which the ink is 
"applied" as the user draws the entry on the digitizing writing surface. Figure 18 gives an overview of the manner in 
which pen stroke dasstf ication is performed using vector quantization. The ink data type records the motion of the pen 
tip over the surface of the digitizing tablet as a string of (X.Y) ink points. The individual (X,Y) ink points are sequentially 
captured, thereby preserving the temporal or time-based component of the data. Thus the ink data type may be consid- 
ered as comprising (X.Y.T) vectors. 

As illustrated in Figure 18, the incoming ink data 200 are broken into strokes as at 202. Segmenting the ink data 
into strokes allows each stroke to be analyzed separately. By way of illustration, Figure 18 shows that the plus sign (+) 
in the incoming data 200 was drawn by the user, first forming a horizontal line and then forming a vertical line. This is 
illustrated at 202 by reading the segmented data at 202 from left to right. 

After stroke segmentation the individual strokes are then analyzed to extract feature vectors- This is shown dia- 
grammaticaliy at 204. In Figure 18, the extracted feature vectors are shown graphically to simplify the presentation. In 
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the actual embodiment, the extracted feature vectors are represented as numerical data that is stored in the computer 
As indicated at 206, each extracted feature vector is classified according to a predetermined code book 21 0. The pres- 
ently preferred embodiment stores 64 clusters of stroke types, each cluster being represented by its centroid or average 
stroke of that type. As in the case of the extracted feature vectors (block 204) the feature vector clusters are stored as 

5 numerical computer data. In Figure 18 the data comprising code book 210 are shown graphically (instead of numeri- 
cally) to simplify the presentation. In Figure 18 note that the horizontal line segment of block 206 most closely matches 
the centroid 21 2 of the Type 2 stroke cluster 214. Thus in the output string (block 21 6) the VQ code 2 is used to repre- 
sent the horizontal line in block 206. In block 216 the leftmost numeral 2 corresponds to the leftmost horizontal line 
stroke. The remaining codes represent the remaining ink strokes comprising the original incoming ink data. 

10 Through the above-described procedure the incoming ink data is converted, pen stroke by pen stroke, into a feature 
vector that corresponds to each individual pen stroke. The set of feature vectors which collectively represent a series of 
pen strokes are stored in the computer database as the user-drawn annotation. This is depicted at 218. 

To further illustrate, a software block diagram of the presently preferred embodiment is shown in Figure 19. The 
annotation system operates on digitized pen stroke data that is ultimately represented as an "ink" data type. As will be 

15 illustrated, it is not necessary to convert the ink data type into an ASCII character data type in order to perform the 
search and retrieval procedures. Indeed, in the case of graphical (nontext) annotations, conversion to ASCII would have 
no meaning. Thus, a significant advantage is that the annotation system operates in a manner which allows the "ink" 
data to be language-independent. 

Illustrated in Figure 19, the user-drawn query 300 is captured as a string of (X,Y) ink points, corresponding to the 

20 motion of the pen tip over the surface of the digitizing tablet or pad as the user draws query 300. The presently preferred 
embodiment digitizes this information by sampling the output of the digitizing pad at a predetermined sampling rate. 
Although a fixed sampling rate is presently preferred, the invention can be implemented using a variable sampling rate, 
as well. By virtue of the digitized capture of the X,Y position data, both spatial and temporal components of the user- 
drawn pen strokes are captured. The temporal component may be implicit information - the ordering of sampled points 

25 relative to one another conveys temporal information. Alternatively, the temporal component may be explicit - the exact 
time each point was sampled is captured from an external dock 

In the presently preferred embodiment, employing a fixed sampling rate, each X,Y data point is associated with a 
different sampling time. Because the sampling rate is fixed, it is not necessary to store the sampling time in order to 
store the temporal data associated with the pen stroke. Simply recording the X.Y position data as a sequence automat- 

30 ically stores the temporal data, as each point in the sequence is known to occur at the next succeeding sampling time. 
In the alternative, if a variable sampling rate system is implemented, (X,Y,T) data is captured and stored. These 
data are the (X,Y) ink points and the corresponding time T at which each ink point is captured. 

The raw ink point data is stored in data store 302. Next, a segmentation process 304 is performed on the stored ink 
point data 302. The presently preferred segmentation process searches the ink point data 302 for Y-minima. That is, the 

35 segmentation process 304 detects those local points at which the Y value coordinate is at a local minimum. In hand- 
drawing the letter "V" as a single continuous stroke, the lowermost point of the letter "V" would represent a Y-minima 
value. 

Segmentation is performed to break the raw ink point data into more manageable subsets. Segmentation is also 
important for minimizing the variation in the way the users produce ligatures; the connection of characters or even 
40 words. These segment subsets may be designated using suitable pointers to indicate the memory locations at which 
the Y-minima occur. In this case, these segmentation pointers may be stored at 306 to be associated with the ink point 
data 302 previously captured. In the alternative, if desired, the segmented data may be separately stored in one or more 
memory buffers instead of using pointers. 

price the raw data has been segmented the individual segments or pen strokes are operated on by a set of extrac- 
ts tion functions 308. The presently preferred embodiment operates on the pen stroke (segment) data using 1 3 different 
extraction functions. These extraction functions each extract a different feature of the pen stroke data that are then used 
to construct a feature vector. Table I lists the presently preferred features that are extracted by the extraction functions 
308. For further background information on these extraction functions, see Rubine, Dean, "Specifying Gestures by 
Example," Computer Graphics, Vol. 25, No. 4, July 1991. The feature vectors of a given stroke are diagrammatically 
50 represented in Figure 19 at 310. 
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Where P represents the total number of points. 

The extracted feature vectors represented at 31 0 are then coded or quantized by comparison with a predetermined 
set of clusters of stroke data types. The feature vector data 310 is quantized by vector quantization process 312 to 
assign each cluster to the closest predetermined stroke type. In this regard, the presently preferred embodiment 
defines 64 different stroke types that are each represented by a different name or number. Although the presently pre- 
ferred system uses 64 different stroke types, the principles of the invention can be employed with a greater or fewer 
number of stroke types. 

The predetermined stroke types are arrived at during a training procedure 313. The training procedure may be 
used to predetermine a vector quantization (VQ) code book 314 that is then used for multiple users. In many commer- 
cial implementations it will be desirable to train the system at the factory, using a set of user-independent training data. 
Alternatively, the training procedure can be used prior to use by an individual user. Both applications work well. In either 
case, the system is still user-dependent because there can be a great deal of variation in the way two different people 
draw the same annotation. Thus the preferred embodiment is best suited to searching one's own annotations. 
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It will be appreciated that in most cases the user will not draw the same annotation in precisely the same way each 
and every time. That is, the (X,Y,T) coordinates and temporal properties of a given annotation may vary somewhat, 
each time the user draws that annotation. The presently preferred system accommodates this variation first by the man- 
ner in which the vector quantization is performed. Specifically, the vector quantization process 312 assigns each input 
stroke to the predetermined vector 315 from the user-dependent stroke types 314 that represents the closest match. 

After each of the strokes representing the query has been processed in this fashion, a comparison is made 
between those strokes and the user-drawn annotations that have been stored in association with the documents in the 
database 320. Thus, for example, the query "important" may be compared against the stored annotation "this is very 
important!" An edit distance analysis is performed to make this comparison. 

Shown as edit distance analysis process 318, the query stroke type string is compared with each of the stored 
annotation stroke type strings 321 of the database 320. The edit distance analysis compares each stroke type value in 
the query string with each stroke type value in each of the annotation strings. A edit distance computation is performed 
by this comparison, yielding the "cost" of transforming (or editing) one string into the other. The individual string/string 
comparisons are then ranked according to cost, with the least cost resultants presented first. In this way, a sorted list 
comprising all or the n-best matches is displayed in the thumbnail sketches of the main browser screen. Alternatively, 
rather than showing a sorted list, the user may be shown the best match on the main browser screen. If the user deter- 
mines that this match is not correct, the user may tap the "Next" button (not shown) to see the next best match. 

Figure 20 shows the basic edit distance technique. In this case, the stored annotation "compress" is compared with 
the query string "compass." It should be understood that Figure 20 depicts the comparison of two strings as a compar- 
ison of individual letters in two differently spelled words. This depiction is intended primarily to aid in understanding the 
edit distance computation technique and not necessarily as a depiction of what two stroke type strings might actually 
look like. In this regard, each of the 64 different stroke types may be arbitrarily assigned different numerical labels. Thus 
the edit distance computation would compare the respective numeric labels of the stored annotation and the input 
query directly with each other. There is no need to convert the individualstrings into ASCII characters and Figure 20 is 
not intended to imply that such conversion is necessary. 

Referring to Figure 20, each time the annotation string stroke value matches the query string stroke value a cost of 
zero is assigned. Thus in Figure 20. a zero cost is entered for the comparison of the first four string values "comp." To 
accommodate the possibility that a string/string comparison may involve insertion, deletion or substitution of values, a 
cost is assigned each time an insertion, deletion or substitution must be made during the comparison sequence. In the 
example of Figure 20, the query string "compass" requires insertion of an additional value "r" after the value "p." A cost 
of one is assigned (as indicated at the entry designated 422). Continuing with the comparison, a substitution occurs 
between the value "e" of the stored annotation string and the value "a" of the query string. This results in an additional 
cost assignment of one being added to the previous cost assignment, resulting in a total cost of two, represented in Fig- 
ure 20 at 424. Aside from these insertion and substitution operations, the remainder of the comparisons match, value 
for value. Thus, the final "cost" in comparing the annotation string with the query string is two. represented in Figure 20 
at 426. ^ 

In the preceding discussion, a first minimum cost path was described in which "compass" is edited into "compress" 
by inserting an "r" and substituting an "e" for an "a." An alternative edit would be to substitute an "r" for an "a" and insert- 
ing an "e." Both of these paths have the same cost, namely two. 

Figure 21 gives another example of the edit distance computation technique. As before, strings of alphabetic char- 
acters are compared for demonstration purposes. As previously noted, this is done for convenience, to simplify the illus- 
tration, and should not be interpreted as implying that the strings must be first converted to alphanumeric text before 
the comparisons are made. Rather, the procedure illustrated in Figures 20 and 21 are performed on the respective 
stroke data (vector quantized symbols) of the respective stored annotation and input query strings. 

Figure 21 specifically illustrates the technique that may be used to perform an approximate match (word spotting). 
In Figure 21 the stored annotation "This is compression," is compared with the query string "compass." Note how the 
matched region 430 is extracted from the full string of the stored annotation by scanning the last row of the table to find 
the indices that represent the lowest value. Note that the first (initializing) row in Figure 21 is all 0s - this allows the 
approximate matching procedure to start anywhere along the database string. 

The presently preferred edit distance procedure is enhanced over the conventional procedures described in the lit- 
erature. In addition to the three basic editing operations (delete a character, insert a character, and substitute one char- 
acter for another), it is useful to add two new operations when comparing pen stroke sequences. These new operations 
are "split" (substitute two strokes for one stroke) and "merge" (substitute one stroke for two strokes). These additional 
operations allow for errors made in stroke segmentation and generally leads to more accurate results. 

The use of our enhanced edit distance procedure is illustrated in Figure 21 . In Figure 21 the split operation is used 
to substitute the letters "re" in "compress" for the letter "a" in "compass." Note that the backtracking arrow in Figure 21 
spans one row but two columns, thereby signifying the multicharacter (merge) substitution. Hence the edit distance is 
one, not two, in this case. By way of comparison, Figure 20 illustrates the basic edit distance algorithm without utilizing 
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the two new multicharacter operations. Thus the cost (as depicted in Figure 20) of editing "compass" into "compress" 
is two. 

The above-described procedure works well in most user-drawn annotation applications. The combined use of vec- 
tor quantizing and edit distance computation yield a system that is remarkably robust in its ability to find matching 
strings and substrings, even if they are not drawn precisely the same way by the user. Although the presently preferred 
embodiment has been illustrated here, a number of variations are possible without departing from the spirit of the inven- 
tion. For example, if a faster match is desired, the system may perform an initial "first pass" match by simply finding all 
strokes that have a similar number of data points. This may be done by storing the number of data points as part of the 
feature data and then simply selecting or excluding those strokes that are not within a predetermined data point count. 
This type of first pass search can be performed quite quickly, as simple numeric matching algorithms are all that are 
required. The first pass technique based on data point count would not, however, allow matching substrings to be 
extracted as the edit distance computation permits. Where higher matching accuracy is desired a more computationally 
costly matching technique such as a Hidden Markov Model technique may be used as a final pass on the n-best hypoth- 
eses determined by the edit distance computation. Adding a highly accurate, but computationally costly processing 
stage to the final output may be used in systems where it is necessary to discriminate between a large number of highly 
similar strings. 

In summary, there is disclosed a system where the user 102 communicates through a digitizing writing surface 26 
with the audio/video control apparatus 20. An on-screen display 32, 34 is generated, providing the user 102 with a user 
environment in which a wide range of different tasks and functions can be performed. The digitizing writing surface 26 
can be incorporated into a hand-held remote control unit 24 and the audio/video control apparatus 20 may likewise be 
incorporated into existing home entertainment or computer equipment By tapping on the writing surface 26 a command 
bar 32 is presented on the screen, allowing the user 1 02 to select among various functions. Included in these functions 
is an on-screen programming feature 158, 148, allowing the user to select programs for viewing or recording by entry 
of user-drawn annotations or commands via the writing surface 26. 

The foregoing discussion discloses and describes exemplary embodiments of the present invention. One skilled in 
the art will readily recognize from such discussion and from the accompany drawings and claims, that various changes, 
modifications and variations can be made therein without departing from the spirit and scope of the invention as defined 
in the following claims. 

Claims 

1. An audio/video system having an enhanced video user environment, characterized by: 

an audioA/ideo control apparatus (20) for selectively performing predetermined audio/video control functions 
(106) in accordance with a user's (102) selection, said control apparatus (20) including a port (62) for coupling 
to a video display apparatus (64; 36; 22) for displaying video material; 

a remote control apparatus (24) having a digitizing writing surface (26) for entry of hand-drawn instructions by 
a user (102), said remote control apparatus (24) communicating with said audio/video control apparatus (20); 
a processor (72) communicating with at least one of said audio/video control apparatus (20) and said remote 
control apparatus (24) for controlling operation of said video display apparatus (64; 36; 22) in accordance with 
said hand-drawn instructions. 

2. The system of claim 1 , characterized in that said remote control apparatus (24) comprises a hand-held push-button 
remote control structure (54-58) with said digitizing writing surface (26) incorporated into said structure (54-58). 

3. The system of claim 1 or 2, characterized in that said remote control apparatus (24) communicates with said 
audio/video control apparatus (20) by infrared signals (30). 

4. The system of any of claims 1-3, characterized in that said remote control apparatus (24) communicates bidirec- 
tional ly with said audio/video control apparatus (20). 

5. The system of any of claims 1-4, characterized in that said remote control apparatus (24) includes a microphone 
for input of speech instructions. 

6. The system of any of claims 1 - 5, characterized in that said digitizing writing surface (26) is responsive to a hand- 
held stylus (28). 

7. The system of any of claims 1 - 6, characterized in that said digitizing writing surface (26) is responsive to the user's 
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fingertip. 

8. The system of any of claims 1 - 7, characterized in that said audio/video control apparatus (20) includes at least 
one control port for coupling to at least one component (36-52, 66) of audio/video equipment and wherein said 
audio/video control apparatus (20) includes a control module for issuing (68; 70) control signals through said con- 
trol port to said component (36-52; 66) of audio/video equipment. 

9. The system of claim 8, characterized in that said component (36-52; 66) of audio/video equipment is a component 
selected from the group consisting of television (36; 22), video cassette recorder (VCR) (46), audio tape recorder 
(44), audio disc player (48). video disc player (48), audio amplifier (42), surround sound processor (38, 40), video 
signal processor, camcorder (50), video telephone, cable television signal selector, satellite antenna controller, 
computer (52), CD-ROM player, photo CD player, video game player and information network access device. 

10. The system of any of claims 1 - 9, characterized in that said processor (72b) is disposed in said audio/video control 
apparatus (20). 

11. The system of any of claims 1 - 9, characterized in that said processor is attached to said audio/video control appa- 
ratus (20), 

1 2. The system of any of claims 1 - 9, characterized in that said processor (72a) is disposed in said remote control 
apparatus (24). 

13. The system of any of claims 1 - 9, characterized in that said processor (72) comprises a multiprocessor system 
(72a, 72b) having a first portion (72b) disposed in said audio/video control apparatus (20) and having a second por- 
tion (72a) disposed in said remote control (24). 

14. The system of any of claims 1 - 14, characterized in that said audio/video control apparatus (20) includes an inte- 
grated television tuner for tuning a user selected channel carrying program information and providing a video signal 
representing said program information to said video display apparatus (64; 36; 22). 

15. The system of any of claims 1 - 14, characterized in that said video display apparatus (64; 36; 22) is a television 
(36; 22) and wherein said audio/video control apparatus (20) outputs a video signal through said port, preferably 
an NTSC or PAL or HDTV signal. 

16. The system of any of claims 1-15, characterized in that said audio/video control apparatus (20) is incorporated 
into a component of audio/video equipment. 

17. The system of claim 16, characterized in that said component of audio/video equipment is a component selected 
from the group consisting of television (36; 22), video cassette recorder (VCR) (46); audio tape recorder (44), audio 
disc player (48), video disc player (48), audio amplifier (42), surround sound processor (38, 40), video signal proc- 
essor, camcorder (50), video telephone, cable television signal selector, satellite antenna controller, computer (52), 
CD-ROM player, photo CD player, video game player and information network access device. 

18. The system of any of claims 1 - 17, characterized in that said processor (72) includes a speech recognizer module. 

19. The system of any of claims 1 - 18, characterized in that said processor (72) generates at least one menu (32, 34) 
of user selectable system control options (136, 156, 160-168) and said audio/video control apparatus (20) issues 
a signal through said port (62) to display said menu (32, 34) on said video display apparatus (64; 36; 22) coupled 
to said port (62), wherein, in case said processor (72) is a multiprocessor system (72a, 72b), at least one processor 
of said multiprocessor system (72a, 72b) generates said at least one menu (32, 34). 

20. The system of any of claims 1-19, characterized in that said processor (72) is coupled to memory means (74, 76, 
86, 88) for storing user input. 

21 . The system of claim 20, characterized in that said user input comprises handwritten annotations drawn on said dig- 
itizing writing surface (26). 

22. The system of claim 21 , characterized by an on-demand video interface (158, 1 48) whereby said handwritten anno- 
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tations are used to recall a prerecorded entertainment program for presentation on said video display apparatus 
(64; 36; 22). 

23. The system of claim 21 or 22, characterized in that said handwritten annotations are translated into a known com- 
puter character set for subsequent processing. 

24. The system of any of claims 1 - 23, characterized in that 

said digitizing writing surface (26) is a digitizing writing display surface for entry of said hand-drawn instructions 
by said user and for displaying information to said user; and 

said processor (72) is a multiprocessor system (72a, 72b) having a first portion (72b) disposed in said 
audio/video control apparatus (20) and having a second portion (72a) disposed in said remote control (24), 
said multiprocessor system (72a, 72b) communicating between said audio/video control apparatus (20) and 
said remote control apparatus (24) for controlling operation of said video display apparatus (64; 36; 22) in 
accordance with said hand-drawn instructions. 



14 



EP 0 838 945 A2 




EP 0 838 945 A2 




EPO 838 945 A2 




EP 0 838 945 A2 




Figure 4 



EP0 838 945 A2 



24 ^ 



RAM 



72a 



2. 



ROM 



-74 



-76 



Processor 



IR Interface 




77 




71 



.80 



.78 



M 
I 

C 





Tablet 




Tablet 




Interface 





82 



IR Interface 



Processor 



86 



.90 



92, 



Video 



-72b 



L 



88 









RAM 




ROM 



Figure 5 




20 



Video Out 




62 



19 



EP 0 838 945 A2 




102 



114. 



112. 



100 



110. 



108 



A 

V 


116 


v User Interface y 


/ 


Applications ' 


>- 




^ Drivers 
and Libraries 





>- 


Microkernel 


~>- 


, Hardware 
Abstraction Layer 






106 



• Electronic Program Guide 

• Video Player 

• Multi-User Games 



y 



Input 
♦ Video 
Network Management 



< 



> 



< 



Small Real-Time 
Operating System 



Timers, Tuning, Video, 
Graphics, Security, 
Peripheral Control 



Figure 6 



20 



EP 0 838 945 A2 




21 



EP0 838 945 A2 




22 



EP0 838 945 A2 




23 



EP 0 838 945 A2 



c 
in 



u 



CO 

co 



o 



o 

LU 

Q 

> 



♦ ♦ ♦ 4 



9-3 1 - -2 
D O ^ ro ra 

> cd m 



m 


CD 


en 






in 


CO 


► 






K 


o 



^ — 




cr 
c 

*a 
a 
o 

CO 
CO 

E 

03 
CO 



U 
> 



to 

CO 



o 



24 



EP 0 838 945 A2 




25 



EP 0 838 945 A2 



CM 

in 



o 
in 



4- 



&»|©i8>|@|<r>; 

00 Q <_> 
Sf S El 0 El £3 



oo 





6> 


@ 


@ 




§> 


8> 




*_ 


2:30 pm 


Trading 
Places (R 


u 

~£ 

SO 


«> 

v> 
i- 3 
5 ° 

O X 


a 

o °= 
X ui 










E 

CL 
O 
O 


1© 

</) 

o 


@ 

to 

X) 


n 

J£ 
c 

s 8 

V) o- 


V) 

O 4> 


TO 




,|> 




i e 

: o 


I© 

T> 
~C 

o 


@ 

11 


T3 
IU 
V 

c ° 


Simpsons 


w 
« u 

" s 

a. u. 


§j 


-7TT 












"01 










E 

Q. 

O 
O 


O.EO 

o 
u 


Entertain. 
Tonight g 


o 

V) 

~o5 

<d 
ce 


*5 

O U 

a> o 


* § 
i o 


O u 
JU U 


t/> 
Cl 

o 




) pm 


CO ^ 
V 




01 


c 
c 
« 




o 


-Q u 

lO u 

o w 




12:3C 






§5 


O <j 

en o- 






° 5 

OJ o 

£6 


T 



u 



Q. 

a. 
o 
_c 

CO 
to 

<L> 
E 

aj 

03 
l_ 



> 



x: 
o 



JC 

u 



.c 



o 
a. 



^ — 



26 



EP0 838 945A2 



l#l©Ig>|@I<S>l 



L 



D 0 D S ® 



2:30 pm 




@ 

o 
= G 


V 

1- 3 

as 










2:00 pm 


World Series 


Friends 
(cc) © 


Seinfeld 










1:30 pm 


s 


tt> 




s © 

s 








1 :00 pm 
















12:30 pm 








c 
c 

CO 

CJ _ 

£^ 






XI u 
U» u 
O w 

° J 

V o 
jz x: 

»- V) 



■e 

-C 
.CO. 



x: x: 



ca 

x: .c 



c 

00 



cn 
c 
'a. 
a 
o 
x: 
tn 

co 
E 



O 

a. 



on 



cn 



27 



EP 0 838 945 A2 




28 



EP0 838 945 A2 




29 



EP 0 838 945 A2 




30 



EP 0 838 945 A2 




31 



EP 0 838 945 A2 



Incoming 
Ink Data 




200 



Figure 18 



T 

Break 
Into Strokes 



-lo 


— \ 


Ex 


ract 



202 



Feature Vectors 



204 




Classify -compare 
Stroke Type / Fea ture Vector 
for Input Stroke 
to All Cluster 
Centroids 



T 

Output String 
of Stroke Types 




216 




Centra id 



Type 1 Strokes 
'Vertical Lines" 



Centroid 



Type 2 Strokes 
"Horizontal Lines" 



Centroid 



Type 3 Strokes 
"circles" 







(27.2, 14, 0.933, ...) 


Ink 




Feature Vector 



210- 



One Pen Stroke 



■218 



32 



EP0 838 945 A2 




33 



EP0 838 945 A2 




CD 



CD 



CO 
CO 

2 

CL 

E 

8 



CO 
CO 
CO 
CL 

E 
o 
o 



0) 

o 



UJ 



CD 

2 o5 



CD 

CD 
CD 

- 3 

0) » 

to -S 

c 3 

*— to 



o 

CN 



c 

.2 

o 
c 
c 

< 

CD 

o 
CO 



oo 



fs O LO AT CO (CM; 



o m co co cm cm 




. — O • — CM CO IQ 



o *— cm co "^r to o 



o o E a o 



Ajano 



Z5 
CD 



c 
O 

o 
c 
c 

< 

CD 
i_ 

o 

CO 



O 



O 



O 



CM CM CO CO XT 



< — « — CM CM CO CO 



• — ■ — CM CM CO CO CO 



» — > — CM CM CO CO CM 



• — • — CM CM CO CM • — 



♦ — » — CM CM CM ' — CM 



r~ r- <N CM i— CN CM 



r- CM r— /r— — CM 



■ — r— CD « — CM CO 



< — « — O « — CM CO "^T 



/ 



/ 



/ 



/ 



i— CD t— CM CO cO ^3" 



O r- (N CM CO CO ^ 



i — i — CM CM CO CO 



i — i — CM CM CO CO CO 



r— i — CM CM CO CO ^sT 



rrr r- CM CM CO CO 



f— r— CM CM CO CO CO 



» — CM CM CO CO ^ 



• — » — CM CM CO lO 



» — < — CM CO ^ m O 



o cm co -^rioor^ 



O O E Q. o 



Ajano 



34 



